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Abstract. In the online labeling problem with parameters n and m we 
are presented with a sequence of n keys from a totally ordered universe U 
and must assign each arriving key a label from the label set {1, 2, . . . , m} 
so that the order of labels (strictly) respects the ordering on U. As new 
keys arrive it may be necessary to change the labels of some items; such 
changes may be done at any time at unit cost for each change. The goal 
is to minimize the total cost. An alternative formulation of this problem 
is the file maintenance problem, in which the items, instead of being 
labeled, are maintained in sorted order in an array of length m, and we 
pay unit cost for moving an item. 

For the case m = cn for constant c > 1, there are known algorithms 
that use at most 0(n log(n) 2 ) relabelings in total [10], and it was shown 
recently that this is asymptotically optimal [2]. For the case of m = 
0(n c ) for C > 1, algorithms are known that use O(nlogn) relabelings. 
A matching lower bound was claimed in [8]. That proof involved two 
distinct steps: a lower bound for a problem they call prefix bucketing 
and a reduction from prefix bucketing to online labeling. The reduction 
seems to be incorrect, leaving a (seemingly significant) gap in the proof. 
In this paper we close the gap by presenting a correct reduction to prefix 
bucketing. Furthermore we give a simplified and improved analysis of the 
prefix bucketing lower bound. This improvement allows us to extend the 
lower bounds for online labeling to the case where the number m of labels 
is superpolynomial in n. In particular, for superpolynomial m we get an 
asymptotically optimal lower bound f2((n log n)/(log log m — log log n)). 
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1 Introduction 



In the online labeling problem with parameters n, m, r, we are presented with a 
sequence of n keys from a totally ordered universe U of size r and must assign 
each arriving key a label from the label set {1,2,..., m} so that the order of labels 
(strictly) respects the ordering on U. As new keys arrive it may be necessary 
to change the labels of some items; such changes may be done at any time at 
unit cost for each change. The goal is to minimize the total cost. An alternative 
formulation of this problem is the file maintenance problem, in which the items, 
instead of being labeled, are maintained in sorted order in an array of length m, 
and we pay unit cost for moving an item. 

The problem, which was introduced by Itai, Konheim and Rodeh [10] . is 
natural and intuitively appealing, and has had applications to the design of data 
structures (see for example the discussion in [5], and the more recent work on 
cache-oblivious data structures |4I6I5] ). A connection between this problem and 
distributed resource allocation was recently shown by Emek and Korman [9] . 

The parameter m, the label space must be at least the number of items n or 
else no valid labeling is possible. There are two natural range of parameters that 
have received the most attention. In the case of linearly many labels we have 
m = cn for some c > 1, and in the case of polynomially many labels we have 
m = 9{n G ) for some constant C > 1. The problem is trivial if the universe U is a 
set of size at most m, since then we can simply fix an order preserving bijection 
from U to {1, ... , m} in advance. In this paper we will usually restrict attention 
to the case that U is a totally ordered set of size at least exponential in n (as is 
typical in the literature). 

Itai et al. [TU] gave an algorithm for the case of linearly many labels having 
worst case total cost 0(n\og(n) 2 ). Improvements and simplifications were given 
by Willard [TT] and Bender et al. In the special case that m = n, algorithms 
with cost 0(log(n) 3 ) per item were given |12l7j . It is also well known that the 
algorithm of Itai et al. can be adapted to give total cost 0(n log(n)) in the case 
of polynomially many labels. All of these algorithms make no restriction on the 
size of universe U of keys. 

For lower bounds, a subset of the present authors recently proved [2] a tight 
i?(nlog(n) 2 ) lower bound for the case of linearly many labels and tight bound 
f2(n\og(n) 3 ) for the case m — n. These bounds hold even when the size of the 
universe U is only a constant multiple of m. The bound remains non-trivial (su- 
perlinear in n) for m = 0(n log(n) 2 ~ e ) but becomes trivial for m £ f2(n log(n) 2 ). 

For the case of polynomially many labels, Dietz at al. [5] (also in (T2]) claim 
a matching lower bound for the 0(nlog(n)) upper bound. Their result consists 
of two parts; a lower bound for a problem they call prefix bucketing and a reduc- 
tion from prefix bucketing to online labeling. However, the argument giving the 
reduction seems to be incorrect, and we recently raised our concerns with one of 
the authors (Seiferas), who agrees that there is a gap in the proof. 

This paper makes the following contributions: 



— We provide a correct reduction from prefix bucketing to online labeling, 
which closes the gap in the lower bound proof for online labeling for the case 
of polynomially many labels. 

— We provide a simpler and more precise lower bound for prefix bucketing 
which allows us to extend the lower bounds for online labeling to the case 
where the label size is as large as 2™ . Specifically we prove a lower bound 
of f2((n log n) /(log log m — log log n)) that is valid for m between n 1+£ and 
2™ . Note that for polynomially many labels this reduces to J?(nlog(n)). 

We remark that, unlike the bounds of [2] for the case of linearly many labels, 
our lower bound proof requires that the universe U is at least exponential in n. 
It is an interesting question whether one could design a better online labeling 
algorithm for U of size say mlogn. We summarize known results in Table [TJ 
All the results are for deterministic algorithms. There are no known results for 
randomized algorithms except for what is implied by the deterministic case. 



Table 1. Summary of known bounds for the online labeling problem. 



Array size (m) Lower bound 
m — n J?(nlog(n) 3 ) 
m — 0(n),m>n Q(n log(n) 2 ) 


Upper bound 
g] 0(nlog(n) 3 ) QU 
[2] Q(n log(n) 2 ) [10] 


m = n i+o(i) Q{ 


( nlog(n)^ 
I log m — log 


y 


1 [JJ Ol 


( nlog(ra)' 
i log m — log 




m = n i+©(i) J?(nlog(n)) 




[this paper] 0(nlog(n)) 


m 


m = n «(log(n)) Q{ 


f n log n \ 
\ log log m J 


! [this paper] 0\ 


f n log n } 
\ log log m j 





Our proof follows the high level structure of the proof from j8]. In the re- 
mainder of the introduction we sketch the two parts, and relate our proof to the 
one in [5]. 

1.1 Reducing Online Labeling to Prefix Bucketing 

Dietz et al. [S] sketched a reduction from online labeling to prefix bucketing. 
In their reduction they describe an adversary for the labeling problem. They 
show that given any algorithm for online labeling, the behavior of the algorithm 
against the adversary can be used to construct a strategy for prefix bucketing. 
If one can show that the cost of the derived bucketing strategy is no more than 
a constant times the cost paid by the algorithm for relabelings then a lower 
bound on bucketing will give a similar lower bound on the cost of any relabeling 
algorithm. Unfortunately, their proof sketch does not show this. In particular, a 
single relabeling step may correspond to a bucketing step whose cost is i7(log(n)), 
and this undermines the reduction. This may happen when inserting 6>(logn) 
items into an empty segment of size n e without triggering any relabelings. We 



construct a different adversary for which one gets the needed correspondence 
between relabeling cost and bucketing steps. 

Our goal in constructing an adversary is to force any online algorithm to 
perform many relabelings during insertion on n keys. The adversary is described 
in detail in Section [5] here we provide a high level description. 

The adversary starts by inserting the minimal and maximal element of U, i.e. 
1 and r, and five other keys uniformly spread in U. From then on the adversary 
will always pick a suitable pair of existing consecutive keys and the next inserted 
key will be the average of the pair. Provided that r > 2™ there will always be an 
unused key between the pair. 

It is illuminating to think of the problem in terms of the file maintenance 
problem mentioned earlier. In this reformulation we associate to the label space 
[m] an array indexed by {1, . . . ,m} and think of a key labeled by j as being 
stored in location j. Intuitively, the adversary wants to choose two consecutive 
keys that appear in a crowded region of this array. By doing this repeatedly, 
the adversary hopes to force the algorithm to move many keys within the array 
(which corresponds to relabeling them). The problem is to make precise the 
notion of "crowdedness" . Crowding within the array occurs at different scales 
(so a small crowded region may lie inside a large uncrowded region) and we need 
to find a pair of consecutive keys with the property that all regions containing 
the pair are somewhat crowded. 

With the array picture in mind, we call an interval of labels a segment, and 
say that a label is occupied if there is a key assigned to it. The density of a 
segment is the fraction of occupied labels. 

In [8], the authors show that there is always a dense point, which is a point 
with the property that every segment containing it has density at least half the 
overall density of the label space. They use this as the basis for their adversary, 
but this adversary does not seem to be adequate. 

We design a different adversary (which is related to the adversary constructed 
in [5] to handle the case of linearly many labels). The adversary maintains a se- 
quence (hierarchy) of nested segments. Each successive segment in the hierarchy 
has size at most half the previous segment, and its density is within a constant 
factor of the density of the previous segment. The hierarchy ends with a segment 
having between 2 and 7 keys. The key chosen to be inserted next is the average 
(rounded down) of the two "middle" keys of the smallest segment. 

For the first 8 insertions, the hierarchy consists only of the single segment 
[m]. After each subsequent insertion, the algorithm A specifies the label of the 
next key and (possibly) relabels some keys. The adversary then updates its 
hierarchy. For the hierarchy prior to the insertion, define the critical segment 
to be the smallest segment of the hierarchy that contains the label assigned to 
the inserted key and the old and new labels of all keys that were relabeled. 
The new hierarchy agrees with the previous hierarchy up to and including the 
critical segment. Beginning from the critical segment the hierarchy is extended 
as follows. Having chosen segment S for the hierarchy, define its left buffer to be 
the smallest subsegment of S that starts at the minimum label of S and includes 



at least 1/8 of the occupied labels of S, and its right buffer to be the smallest 
subsegment that ends at the maximum label of S and includes at least 1/8 of 
the occupied keys of S. Let S' be the segment obtained from S by deleting the 
left and right buffers. The successor segment of S in the hierarchy is the shortest 
subsegment of S' that contains exactly half (rounded down) of the occupied 
labels of S'. The hierarchy ends when we reach a segment with at most seven 
occupied labels; such a segment necessarily has at least two occupied labels. 

It remains to prove that the algorithm will make lot of relabels on the se- 
quence of keys produced by the adversary. For that proof we need a game intro- 
duced by Dietz et al. [5] that they call prefix bucketing. 

A prefix bucketing of n items into k buckets (numbered 1 to k) is a one 
player game consisting of n steps. At the beginning of the game all the buckets 
are empty. In each step a new item arrives and the player selects an index p € [k] . 
The new item as well as all items in buckets p+ 1, . . . , k are moved into bucket p 
at a cost equal to the total number of items in bucket p after the move. The goal 
is to minimize the total cost of n steps of the game. Notice that suffix bucketing 
would be a more appropriate name for our variant of the game, however we keep 
the original name as the games are equivalent. 

We will show that if A is any algorithm for online labeling and we run A 
against our adversary then the behavior of A corresponds to a prefix bucketing 
of n items into k — [logm] buckets. The total cost of the prefix bucketing will 
be within a constant factor of the total number of relabelings performed by the 
online labeling algorithm. Hence, a lower bound on the cost of a prefix bucketing 
of n items into k buckets will imply a lower bound on the cost of the algorithm 
against our adversary. 

Given the execution of A against our adversary we create the following pre- 
fix bucketing. We maintain a level for each key inserted by the adversary; one 
invariant these levels will satisfy is that for each segment in the hierarchy, the 
level of the keys inside the segment are at least the depth of the segment in the 
hierarchy. The level of a newly inserted key is initially k. After inserting the key, 
the algorithm does its relabeling, which determines the critical segment and the 
critical level. All keys in the current critical segment whose level exceeds the 
critical level have their levels reset to the current critical level. 

The assignment of levels to keys corresponds to a bucketing strategy, where 
the level of a key is the bucket it belongs to. Hence, if p is the critical level, all 
the keys from buckets p,p + 1, . . . , k will be merged into the bucket p. 

We need to show that the cost of the merge operation corresponds to the 
number of relabelings done by the online algorithm at a given time step. For this 
we make the assumption (which can be shown to hold without loss of generality) 
that the algorithm is lazy, which means that at each time step the set of keys 
that are relabeled is a contiguous block of keys that includes the newly inserted 
keys. The cost of the bucketing merge step is at most the number of keys in the 
critical segment. One can argue that for each successor segment of the critical 
segment, either all labels in its left buffer or all labels in its right buffer were 



reassigned, and the total number of such keys is a constant fraction of the keys 
in the critical segment. 



1.2 An Improved Analysis of Bucketing 

It then remains to give a lower bound for the cost of prefix bucketing. This was 
previously given by Dietz et al. [5] for fc G 6>(logn). We give a different and 
simpler proof that gives asymptotically optimal bound for fc between log n and 
0(n e ). We define a family of trees called fc-admissible trees and show that the 
cost of bucketing for n and fc, is between dn/2 and dn where d is the minimum 
depth of a fc-admissible tree on n vertices. We further show that the minimum 
depth of a fc-admissible tree on n vertices is equal gk(n) which is defined to 
be the smallest d such that ( > n. This gives a characterization of the 

optimal cost of prefix bucketing (within a factor of 2). When we apply this 
characterization we need to use estimates of gk(n) in terms of more familiar 
functions (Lemma ITU . and there is some loss in these estimates. 



2 The Online Labeling Problem 

In this paper, interval notation is used for sets of consecutive integers, e.g., [a, b] 
is the set {k E Z : a < k < b}. Let m and r > 1 be integers. We assume without 
loss of generality U = [r]. An online labeling algorithm A with range m is an 
algorithm that on input sequence y\ y 2 , . . . , y* of distinct elements from U gives 
an allocation f : {y , y 2 , . . . , y*} — > [m] that respects the natural ordering of 
y 1 , . . . , y* that is for any x, y £ {y 1 , y 2 , . . . , y*}, f(x) < f(y) if and only if x < y. 
We refer to y x , y 2 , . . . , y* as keys. The trace of A on a sequence y 1 , y 2 , . . . , y™ G U 
is the sequence f° , f 1 , f 2 , . . . , /" of functions such that /° is the empty map- 
ping and for i — 1, . . . , n, p is the output of A on y 1 , y 2 , . . . , y\ For the trace 
/°i J 1 ; Pi •••)/" an d t = 1, . . . , n, we say that A relocates y e {y 1 , y 2 , . . . , y*} 
at time t if f t ~ 1 (y) ^ f f (y)- So each y* is relocated at time t. For the trace 
/°i J 1 ; / 2 ; • ■ • j /" an d t = 1, ... ,7i, Rel* denotes the set of relocated keys at step 
t. The cost of A incurred on y\y 2 , . ..,y n G U is X^(?/l ; ■ • ■ ,2/n) = E"=o 1-^1 
where i?eZ is measured with respect to the trace of A on y 1 , y 2 , . . . , y™. The 
maximum cost x^i(y 1 , . . . , y") over all sequences y 1 , . . . , y™ is denoted X^l( n )- 
We write Xm{n) for the smallest cost XA( n ) that can be achieved by any algo- 
rithm A with range m. 



2.1 The Main Theorem 

In this section, we state our lower bound results for Xmin). 

Theorem 1. There are positive constants C$,C\ and Ci so that the following 
holds. Let m, n be integers satisfying Co < n < m < 2™ 1 . Let the size of U be 
more than 2"+ 4 . Then Xm (n) > C 2 ■ 3+logl " g l ^ loglogn - 

To prove the theorem for given algorithm A we will adversarially construct a 
sequence y , y 2 , . . . , y* of keys that will cause the algorithm to incur the desired 
cost. In the next section we will design the adversary. 



2.2 Adversary Construction 



Any interval [a, b] C [m] is called a segment. Fix n, m, r > 1 such that m > n 
and r > 2"+ 4 . Fix some online labeling algorithm A with range m. To pick the 
sequence of keys y , . . . ,y n , the adversary will maintain a sequence of nested 
segments 5'depth(t) — ' ' ' — ^2 — &l = [ m \ > updating them after each time step 
i. The adversary will choose the next element y* to fall between the keys in the 
smallest interval S^ epth , t y In what follows, /' is the allocation of y , . . . , y* by 
the algorithm A. 

The population of a segment S at time t is pop^S 1 ) = (f t )~ 1 (S) and the 
weight of S at time t is weight* (S) = |pop*(5)|. For t — 0, we extend the 
definition by pop°(S') = and weight (S) = 0. The density of S at time t 
is p t (S) = weight* (S)/\S\. For a positive integer b, let densify* (S*, b) be the 
smallest subsegment T of S of weight exactly [(weight* (S*) — 26) / 2J such that 
pop*(T) does not contain any of the b largest and smallest elements of pop*(S l ). 
Hence, densify t (S', b) is the densest subsegment of S that contains the appro- 
priate number of items but which is surrounded by a large population of S 
on either side. If pop t (S) = {xi < x-i < ■ ■ ■ < xi} then midpoint* (S) = 

\(x[(£-l)/2] + X[(£+l)/2])/2}- 

Let y 1 , y 2 , . . . , y* be the first t keys inserted and let Rel 1 be the keys that are 
relabeled by A in response to the insertion of y*. The busy segment B C [m] at 
time t is the smallest segment that contains f t (Rel t ) U f t ~ 1 (Rel t \ {y*})- We say 
that the algorithm A is lazy if all the keys that are mapped to B* are relocated 
at step t, i.e., (f t )~ 1 (B t ) = ReV 1 . By Proposition 4 in [2], when bounding the 
cost of A from below we may assume that A is lazy. 

Adversary(_4, n, m, r) 

Set p° = 0. 

For t = l,2,...,n do 

- If t < 8, let Si = [m], b\ = 1, depth(<) = 1, and p* = 1. Set y* = 1 + \(t- 
1) • (r — l)/6] , and run A on y 1 , y 2 , . . . , y* to get /'. Continue to next t. 

- Preservation Rule: For i = 1, . . . ,p t_1 , let 5* = S**" 1 and 6* = fr^ 1 . (For 
t > 8, copy the corresponding segments from the previous time step.) 

- Rebuilding Rule: 
Set i =p'- 1 + 1. 

While weight*" 1 (Sj^) > 8 

• Set 5* = density*- 1 (S t i _ 1 ,b t i _ 1 ). 

• Set b\ = [weight*" 1 (5*)/8l. 

• Increase i by one. 

- Set depth(t) = i - 1. 

- Set y* = midpoint*" 1 (5'depth(t))' 

- The critical level: Run A on y 1 , y 2 , . . . , y* to get /*. Calculate i?el* and B l . 
Set the critical level p* to be the largest integer j E [depth(i)] such that 
/>" C S**. 



Output: y , y 2 , . . . , y n . 

We make the following claim about the adversary which implies Theorem [TJ 
We did not attempt to optimize the constants. 

Lemma 1. Let m,n,r be integers such that and 2 32 < n < m < 2 ^™/ 8 and 
2'n+i ^ r ^ j^ e ^ j[ f) e a i aZ y online labeling algorithm with the range m. Let 
y 1 ,y 2 , . . . ,y n be the output of Adversary {A, n, m, r). Then the cost 

XA (y\y 2 ,...,y n ) > — nl °§ n 2. 

512 3 + log [log m\ — log log n 6 

Notice, if r > 2"+ 4 then for any t G [n — 1] the smallest pair-wise difference 
between integers y 1 , y 2 , . . . , y l is at least 2 n+1 ~* so y t+1 chosen by the adversary 
is different from all the previous y's. All our analysis will assume this. 

To prove the lemma we will design a so called prefix bucketing game from 
the interaction between the adversary and the algorithm, we will relate the cost 
of the prefix bucketing to the cost XAiy 1 ^ 2 , • • • > y n ), and we will lower bound 
the cost of the prefix bucketing. 

In preparation for this, we prove several useful properties of the adversary. 

Lemma 2. For any t € [n], depth(i) < logm. 

Proof. The lemma is immediate for t < 8. For t > 8, it suffices to show that the 
hierarchy S{,Si,,... satisfies that for each i E [1, depth(t) - 1], \Sj +1 \ < 15*1/2. 
Recall that Sj is obtained from S\_ 1 by removing the left and right buffer to 
get a subsegment S' and then taking Sj +1 to be the shortest subsegment of 5" 
that contains exactly half of the keys (rounded down) labeled in S". Letting L' 
be the smallest |_|S"|/2J labels of S' and R' be the largest [\S'\/2\ labels of S', 
one of V and R' contains at least half of the occupied labels (rounded down) in 
S', which implies \Sj +1 \ < \S'\/2 < \S\\/2. ^ □ 

Lemma 3. For any t £ [n] and i G [depth(t) - 1], 64 ■ b\ +1 > weight*" 1 ^) - 
weight*" 1 ^). 

Proof. For t < 8 the lemma is true trivially so we assume that t > 8. For any 
integers s,s' such that start(£, i) < s < s' < end(t, i), 

weight 5 " 1 (5|) - weigbt i,-1 (S5) + (s' - s). 

Let s = start (f, i + 1). Then start(t, i) < s < t < end(t, < end(t, i) so 

weight t_1 (S5) - weight'" 1 (S** +1 ) = weight 5 - 1 ^*) - weight 5 " 1 ^!) 

= weight'- 1 ^") - weight 5 -^^^!). 

Also 



b\ +x = bf +1 = [weight 5 - 1 (Sf +1 )/81 > weight 8 " 1 (S s i+1 )/8. 



Since 8 < weight 8 " 1 (Sj) and weight start( *^ )_1 (5*) < weight 8 " 1 (Sj) 



< weight 8 " 1 {St) /A. 



Hence, 



weight 8 " 1 ^) = L(weight 8 - 1 (5 8 )-2& 8 )/2j > [weight*" 1 (Sf)/4j 

> weight 8 " 1 (5 8 )/8. 

Thus, b\ +1 > weight 8 " 1 (S?)/64 > (weight s_1 (S?)-weight s_1 (S? +1 ))/64. The 
claim follows. □ 

Corollary 1. For any t e [n] and i e [depth(i) - 1], 8 + 64 • Ej=^+i (t) h ) > 
weight'" 1 ^). 

Lemma 4. If A is lazy then for any t € [n], \Rel t \ > Y^t=pt+? H- 

Proof. If p t — depth(i) then the lemma is trivial. If p t = depth(t) — 1 then 
the lemma is trivial as well since \Rel f \ > 1 and & depth ( t ) = 1 always. So let us 
assume that p l < depth(i) — 1. By the definition of p l we know that at least one 
of the following must happen: /*" 1 (min(i?e/*)) <G S pt \ S t pt+1 , /*(min(i?d*)) e 
S pt \ S£ t+1 , /*- 1 (max( J R e /*)) e S<< \ S< t+1 or /*(max(i*d*)) e 5*. \ S* t+1 . 
Assume that /^(min^eZ*)) e S** t \ S** t+1 or /*(min(i 1 'e/ t )) e S** t \ S** t+1 , the 
other case is symmetric. Let Y = {y e {y 1 , y 2 , . . . , y* -1 }; min(i?e/*) < y < y*}. 
Since .4 is lazy, Y C iieZ*. S* t+ i \ 'S'depth(t) ^ s * ne UIUOn of two subsegments, 
the left one Sl and the right one Sr. The population of Sl at time t — 1 
must be contained in Y\ For any i e [depth(i) — 1], the population of the left 
subsegment of Sj \ Sj +1 at time t — 1 is at least b\, by the definition of Sj +1 . 

Hence, Ef^+i^ 1 ^ < Ipop'" 1 ^)! < \Y\ < \Rel% Since 6 depth(t) = 1, the 
lemma follows. □ 



Corollary 2. Let A be a lazy algorithm. Then 64 • x^y 1 , y 2 , . . . , y n ) + 8n > 
E?=i weight*" 1 ^)- 



3 Prefix Bucketing 

A prefix bucketing of n items into k buckets is a sequence a , a 1 , . . . , a" € N fe of 
bucket configurations satisfying: a = (0,0,..., 0) and for t = 1, 2, . . . , n, there 
exists p l G [fc] such that 

1. a \ = a*" 1 , for all i = 1, 2, . . . ,p t - 1, 

2. a* t = 1 + E l > pt «r 1 , and 

3. a* = 0, for alU = p* + 1, . . . , k. 



The cost of the bucketing a , a 1 ,..., a" is c(a°,a 1 , . . . ,a n ) = X)"=i a p*- I n 
Section l3.1l we prove the following lemma. 

Lemma 5. Let n > 2 32 and k be integers where logn < k < The cost of 

any prefix bucketing of n items into k buckets is greater than 8(i og 8fc-k)giogn) ~ n - 

We want to relate the cost of online labeling to some prefix bucketing. We 
will build a specific prefix bucketing as follows. Set k — [logm]. For a lazy 
online labeling algorithm A and t = 1, . . . , n, let /*, Sj, B t ,p t ,y t and f°,p° be 
as defined by the Adversary(„4, n, m, r) and the algorithm A. Denote Y = 
{y 1 , y 2 , . . . , y n }. For t — 0, 1, . . . , n and I = 1, . . . , k, define a sequence of sets 
A\ C Y as follows: for alH = 1, . . . , k, A? = 0, and for t > 0: 

- A\ = A* -1 , for alU=l,...,p*-l, 
-4=M U U> P - A*- 1 , and 

- A\ = 0, for alH=p* + 

The following lemma relates the cost of online labeling to a prefix bucketing. 

Lemma 6. Let the prefix bucketing a , a , . . . , a™ &e defined by a\ — \A\\, for all 
t = 0, . . . , n and i = 1, . . . , k. The cost of the bucketing a , a , . . . , a" is at most 

64-XA(y\y\--.,y n ) + $n- 

Lemmas [5] and [5] together imply Lemma [T] The following lemma will be used 
to prove Lemma [6] 

Lemma 7. For any t e [n] and i € [depth(i)], if i ^ p* then f t ^ 1 (A\) C 5- 
otherwise ^(Aj \ {y*}) C Sj. 

Proof. We prove the claim by induction on t. For t = 1, the only non-empty set 
is A 2 = {y 1 } so the claim is true. Let assume that it is true for t — 1 and we 
prove it for t. The Adversary(„4, n, m, r) produces the sets S 1 * and y*, and then 
the algorithm A outputs /*. Based on it the adversary defines E 1 ,Rel l and p*. 
We distinguish two cases. 

Case p 1 " 1 < p t : For all i < p i_1 , A\ = A^ 1 , and for all i > p t_1 , if i ^ p l 
then A\ = otherwise A* = {y*}. For all i < p*" 1 , i? t_1 C S*7-i C S.^ 1 = Sj, 
where the first containment follows from the definition of and the last 
equality follows from the definition of Sj. For all i < y* _1 ^ A^ 1 — A* 

so using the induction hypothesis / t_1 (^*) £ / t_2 (^) U C S 1 *" 1 = Sf. 

Similarly for i = p*" 1 , Z* -1 ^) C /*~ 2 (A* \ {y*" 1 }) U B^ 1 C S 1 *" 1 = S 1 *. For 
each i > either A* = or i = p* and A* = {y*}. In either case the lemma 

follows trivially. 

Case > p* is similar: For all i < p*, A* = A- -1 , for i = p*, A* = 

{v*} U U^>p* and for i > p\ A\ = 0. Again, for all * < p\ B*' 1 C S*" i C 

S 1 * -1 = 5*. For all i < p 4 , y t_1 ^ A* -1 = A* so using the induction hypothesis 
f- l (Al) C /^ 2 (A*) U B*- 1 C S 1 ^ 1 = S|. It only remains to consider the case 
of i — p l as the claim is trivial for i > p* . Since p 1 ^ 1 > p*, for i > p*, S 1 * -1 C 



5* t . Using the induction hypothesis, Z'" 1 ^ \ {y*}) C /*- 1 (U i > p * 4"') ^ 
Ui>„« / t_2 (4 _1 \ {y^ 1 }) U B*- 1 C The lemma follows. □ 
Proof of Lemma® Using the previous lemma we see that for t £ [n], |j4* t | < 
weight* - 1 (5* t ) + 1. The lemma follows by Corollary[2] □ 

3.1 Lower Bound for Bucketing 

In this section we will prove Lemma [5j To do so we will associate with each 
prefix bucketing a fc-tuple of ordered rooted trees. We prove a lower bound on 
the sum of depths of the nodes of the trees, and this will imply a lower bound 
for the cost of the bucketing. 

An ordered rooted tree is a rooted tree where the children of each node are 
ordered from left to right. Since these are the only trees we consider, we refer to 
them simply as trees. The leftmost principal subtree of a tree T is the subtree 
rooted in the leftmost child of the root of T, the i-th principal subtree ofT is the 
tree rooted in the i-th child of the root from the left. If the root has less than 

1 children, we consider the i-th principal subtree to be empty. The number of 
nodes of T is called its size and is denoted \T\. The depth of a node is one more 
than its distance to the root, i.e., the root has depth 1. The cost of T, denoted 
c(T), is the sum of depths of its nodes. The cost and size of an empty tree is 
defined to be zero. 

We will be interested in trees that satisfy the following condition. 

Definition 1 (fc-admissible). Let k be a non-negative integer. A tree T is k- 
admissible if it contains at most one vertex or 

— its leftmost principal subtree is k-admissible and 

— the tree created by removing the leftmost principal subtree from T is (k — 1)- 
admissible. 

Notice that when a tree is fc-admissible it is also fc'-admissible, for any k! > k. 
The following easy lemma gives an alternative characterization of fc-admissible 
trees: 

Lemma 8. A rooted ordered tree T is k-admissible if and only if for each i £ [fc], 
the i-th principal subtree ofTis{k — i+ 1)- admissible. 

Proof. We show both directions at once by induction on fc. For a single- vertex 
tree the statement holds. Let T be a tree with at least 2 vertices and let L be 
its leftmost principal subtree and T" be the subtree of T obtained by removing 
L. By definition T is fc-admissible if and only if L is fc-admissible and T' is fc — 1 
admissible, and by induction on fc, T' is fc — 1 admissible if and only if for each 

2 < i < k the i-th principal subtree of T, which is the (i — l)-st principal subtree 
of T 1 is (k - i + Inadmissible. □ 

We will assign a fc-tuple of trees T(a)i, T(a)2, • ■ • , T{a)k to each prefix buck- 
eting a = a , a , . . . , a*. The assignment is defined inductively as follows. The 
bucketing a — a gets assigned the fc-tuple of empty trees. For bucketing a = 



a , a 1 , . . . , a* we assign the trees as follows. Let p t be as in the definition of prefix 
bucketing, so 7^ a' pt 7^ a*7 an d for all i > p*, a\ = 0. Let a' = a , a 1 , . . . , a 4-1 . 
Then we let T(a)i = T(a')i, for 1 < i < p*, and T(a)i be the empty tree, for 
p* < i < k. The tree T(a) p t consists of a root node whose children are the 
non-empty trees among T(a') p t ,T(a') p t + i, . . . ,T(a')k ordered left to right by 
the increasing index. 

We make several simple observations about the trees assigned to a bucketing. 

Proposition 1. For any positive integer k, if a = a? , a 1 , . . . , a* is a prefix buck- 
eting into k buckets then for each i£ [k], \T(a)i\ = a*. 

The proposition follows by a simple induction on t. 

Proposition 2. For any positive integer k, if a = a°, a 1 , . . . , a* is a prefix buck- 
eting into k buckets then for each i £ [k], T(a)i is (k + 1 — i) -admissible. 

Again this proposition follows by induction on t and the definition of k- 
admissibility. The next lemma relates the cost of bucketing to the cost of its 
associated trees. 

Lemma 9. For any positive integer k, if a = a P,a , , . . . , a* is a prefix bucketing 
into k buckets then X)<=i c (^(^)i) = c(a). 

Proof. By induction on t. For t = the claim is trivial. Assume that the claim 
is true for t — 1 and we will prove it for t. Let a' = a°, a 1 , ... , a* -1 and p* be as 
in the definition of prefix bucketing. 

c(a) = c(a') + 1 + E a ^ = E + 1 + E 

i—p l i—1 i—p t 

p* - 1 fc 

= e ccr(o)i) + 1 + e ( c (n«')* + iT(a')ii) 

i—1 i—p t 

where the first equality follows by the induction hypothesis and by Proposition 
[TJ and the last equality follows by the definition of T(a)i, for i = 1, ...,£>*— 1. 
For i > p , the depth of each node in T(a')i increases by one when it becomes 
the child of T(a) p t hence 

k 

c(T(a) pt ) = l + J2(c(T(a') t + \T(a') t \). 

i—p t 

For i > p*, c(T(a)i) — so the lemma follows. □ 
Now, we lower bound the cost of any ordered rooted tree. 

Lemma 10. Let k,d > 1 and T be a k-admissible tree of depth d. Then c(T) > 
d-\T\/2 and\T\< ( k+ ^- 1 ). 



Proof. We prove the lemma by induction on k + d. Assume first that k = 1 and 
d > 1. The only 1-admissible tree T of depth d is a path of d vertices. Hence, 
\T\=d= C+f 1 ) andc(T) = £?=ii = d-_(d+l)/2. ^ 

Now assume that fc > 1 and that T is fc-admissible. Denote the leftmost 
principal subtree of T by L and the tree created by removing L from T by 
R. By the induction hypothesis and definition of /c-admissibility it follows that 
\T\ = \L\ + \R\ < { k+d - 2 ) + { k+ k d _- 2 ) = ( fc+ ^ 1 ). Furthermore, c(T) = c{L) + 
\L\ + c(R) > ((d - 1) • \L\/2) + \L\ + (d ■ (\T\ - \L\)/2) > d ■ \T\/2. □ 

Lemma 11. Let n, k and d be integers such that 2 32 < n, logn < k < \fnj%, 
^J< 4 ( mX^) - Then ( k+ k d ) < n. 

Proof. First notice that the expression ( k ~l d ) is increasing in d. Therefore to prove 

the lemma it suffices to set d — 4(i og sfc^iog log ti) j anc ^ snow that ( k ^ d ) < n. For 
this particular choice of d it also holds that d < logn < k. 

To estimate the binomial coefficient we use the fact that for two integers 
integers a, b such that < b < a/2 it holds that (£) < 2 H{b / a ^ a where H(x) = 
— x\ogx — (1 — x) log(l — x) is the binary entropy function. For < x < 1/2, 
one can bound H(x) < —2x log x. 

Since ( k+ k d ) = ( fe + d ) and d < (k + d)/2 we conclude that 



k + d\ _ fk + d> 
k ) V d 



^ < 2 H i^) {k+d) < 2- 2dlog (^). 

From the assumption k < it follows that 

logn 



_4(log8fc — log logn) 

By substituting for d we get 

, fk + d\ oJ1 fk + d 
log < 2d log 



logn 

— 8 (log 8fc — log log n) 



k ) \ d 

<2d\o g ( 2 4) 



d J 

logn / 16fc (log 8fc — log logn 

— ^ • 77; — si \ — \ r • log 



4(log8fc — log logn) \ logn 
log n log 2 + log 8fc — log log n + log (log 8fc — log log n) 
2 log 8fc — log log n 



< logn n log 3 
" 2 V3 3 

< logn. 



In the next to last inequality we use the fact that log 8k — log log n > 3 and 
that is decreasing when x > 3. □ 



Proof of Lemma\]j[ Consider a prefix bucketing a = a , a 1 , 
into k buckets, where log n < k < Let a" = (n, 0, 0, . . . , 0) be a fc-tuple 

of integers and let a' = a , a 1 , ... , a*" 1 , a'*. Clearly, a' is also a prefix bucketing 
and c(a') < c(a) + n — 1. Hence, it suffices to show that c(a') > 8(iog8fc-k>giogn) • 
Let T be T(a')i- By Proposition [U |T| = n, and by Proposition T is fc- 
admissible. Furthermore, by LemmalHJ c(a) = Yli=i C C^(^)«) = C (D- So we only 
need to lower bound c(T). By Lemma [TU] a k- admissible tree has size at most 
where d is its depth. For d < ^Xg^v (^f" 1 ) < < - by 

Lemma HI] so T must be of depth d > 4(log g^g^ log w) . By Lemma [TU1 c(T) > 
n log „ The lemma fo ii ows . D 



8(log8fc-log 
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